首页> 外文OA文献 >Leukemia and small round blue-cell tumor cancer detection using\ud microarray gene expression data set: Combining data dimension\ud reduction and variable selection technique
【2h】

Leukemia and small round blue-cell tumor cancer detection using\ud microarray gene expression data set: Combining data dimension\ud reduction and variable selection technique

机译:使用\ ud检测白血病和小圆形蓝细胞肿瘤癌 基因芯片基因表达数据集:组合数据维\ ud 约简和变量选择技术

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Using gene expression data in cancer classification plays an important role for solving the fundamental problems\udrelating to cancer diagnosis. Because of high throughput of gene expression data for healthy and patient samples,\uda variable selection method can be applied to reduce complexity of the model and improve the classification\udperformance. Since variable selection procedures pose a risk of over-fitting, when a large number of variables\udwith respect to sample are used,we have proposed a method for coupling data dimension reduction and variable\udselection in the present study. This approach uses the concept of variable clustering for the original data set.\udSignificant components of local principal component analysis models have just been retained from all clusters.\udThen, the variable selection algorithm is performed on these locally derived principal component variables.\udThe proposed algorithm has been evaluated on two gene expression data sets; namely, acute Leukemia and\udsmall round blue-cell tumor (SRBCT). Our results confirmed that the classification models achieved on the\udreduced data were better than those obtained on the entire microarray gene expression profile.
机译:在癌症分类中使用基因表达数据对于解决与癌症诊断有关的基本问题具有重要作用。由于用于健康和患者样品的基因表达数据的通量很高,因此可以采用\ uda变量选择方法来降低模型的复杂性并提高分类\ ud表现。由于变量选择程序存在过度拟合的风险,因此,当使用相对于样本的大量变量\ ud时,我们在本研究中提出了一种将数据降维与变量\ udselect耦合的方法。这种方法对原始数据集使用了变量聚类的概念。\ ud本地主成分分析模型的重要成分刚刚从所有聚类中保留。\ ud然后,对这些本地派生的主成分变量执行变量选择算法。该算法已在两个基因表达数据集上进行了评估;即急性白血病和小圆形蓝细胞肿瘤(SRBCT)。我们的结果证实,在\减少的数据上获得的分类模型比在整个微阵列基因表达谱上获得的分类模型更好。

著录项

相似文献

  • 外文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号